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Phenotypic analysis of gene-specific knockout (KO) mice has 
revolutionized our understanding of in vivo gene functions. As 
the use of mouse embryonic stem (ES) cells is inevitable for 
conventional gene targeting, the generation of knockout mice 
remains a very time-consuming and expensive process. To 
accelerate the large-scale production and phenotype analyses of 
KO mice, international efforts have organized global consortia 
such as the International Knockout Mouse Consortium (IKMC) 
and International Mouse Phenotype Consortium (IMPQ, and 
they are persistently expanding the KO mouse catalogue that is 
publicly available for the researches studying specific genes of 
interests in vivo. However, new technologies, adopting 
zinc-finger nucleases (ZFNs) or Transcription Activator-Like 
Effector (TALE) Nucleases (TALENs) to edit the mouse genome, 
are now emerging as valuable and effective shortcuts alternative 
for the conventional gene targeting using ES cells. Here, we 
introduce the recent achievement of IKMC, and evaluate the 
significance of ZFN/TALEN technology in mouse genetics. 
[BMB Reports 2012; 45(12): 686-692] 



INTRODUCTION 

Based on both genetic and physiological similarities to hu- 
mans, the mouse has a long history of being used as an in- 
valuable model organism in biological and clinical studies (1, 
2). Furthermore, various genetic tools modifying the mouse ge- 
nome have enabled the researchers to study gene functions in 
vivo, and manipulation of the mouse genome through gene 
targeting in mouse embryonic stem (ES) cells has been central 
to mouse genetics over the past two decades. It is now gen- 
erally accepted that the phenotype analysis of genetically-en- 
gineered mouse (GEM) models has accelerated our under- 
standing of in vivo gene functions. Using GEM models, a ple- 
thora of human diseases (e.g., glioblastoma multiforme) has 
been reproduced and studied in mice (3). In appreciation of 
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the valuable contribution, the Nobel Prize in Physiology or 
Medicine 2007 was awarded jointly to Mario R. Capecchi, Sir 
Martin J. Evans and Oliver Smithies for their discoveries of 
principles of gene targeting in mice using ES cells. 

After completing the human genome project, GEM models 
have been recognized as essential materials for functional ge- 
nomics, and the demands for developing novel GEM models 
have significantly increased. However, gene targeting using 
mouse ES cells is still a time-consuming, expensive, and labori- 
ous process. To initiate breakthroughs, global consortia have 
been organized for the high throughput production and pheno- 
type analysis of GEM models. Furthermore, new technologies 
employing engineered nucleases are emerging and replacing 
the use of ES cells for genome editing in mice. Here, we in- 
troduce the recent achievement of the International Knockout 
Mouse Consortium (IKMC) and International Mouse Phenotype 
Consortium (IMPC), and evaluate the significance of engineered 
nucleases (e.g., ZFN and TALEN technologies) in mouse 
genetics. 

INTERNATIONAL EFFORTS FOR THE LARGE-SCALE 
PRODUCTION AND PHENOTYPE ANALYSES OF 
KNOCKOUT MICE 

The IKMC is an international consortium composed of Knock- 
out Mouse Project (KOMP, USA), European Conditional Mouse 
Mutagenesis Program (EUCOMM, Europe), North American 
Conditional Mouse Mutagenesis Project (NorCOMM, Canada), 
and Texas A&M Institute for Genomic Medicine (TIGM, USA). 
KOMP includes CSD, a collaborative team at the Children's 
Hospital Oakland Research Institute (CHORD, the Wellcome 
Trust Sanger Institute (VVTSI, UK), the University of California at 
Davis School of Veterinary Medicine, and Regeneron Pharma- 
ceuticals, Inc. (USA). 

The aim of IKMC is to mutate all protein-coding genes in the 
mouse by gene trapping and targeting using mouse embryonic 
stem (ES) cells. Historically, ES cell lines from 129 mouse 
strains are prevalently used to target genes in mice. However, 
as C57BL/6 (B6) mice are the most intensively studied mouse 
strain, there is plenty of data describing normative physi- 
ologies of B6 mice. Therefore, a laborious and time-consuming 
backcross had to be conducted to bring the targeted mutation 
into a pure B6 genetic background after KO mice were gen- 
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erated using ES cells derived from 129 strain. To this end, 
I KMC members established the utilization of ES cell lines with 
a pure B6 genetic background (4). Since the Agouti allele of 
B6 mice is recessive, tyros/nase-deficient B6 (thus albino) or 
Balb/c female mice are used to collect host embryos for chi- 
mera generation and to test germ-line transmission for coat 
color screening. Furthermore, they recently simplified this 
complex breeding scheme by restoring the dominant Agouti 
allele in B6 ES cells (5). This experimental setup provides us 
with gene targeted mice of a pure B6 genetic background that 
can be immediately utilized. The comprehensive phenotypes 
of GEM models are also being analyzed through the systematic 
platforms for mouse phenotyping under the International 
Mouse Phenotype Consortium (IMPC) (6). Furthermore, IKMC 
allows the public to freely access KO mouse resources and 
phenotyping data through the internet. Therefore, by generat- 
ing and phenotyping GEM models, these international efforts 
will make it possible to annotate every single gene function in 
mice in the near future. 

The gene targeting strategies with a conditional potential 

The gene targeting strategies by the IKMC can be categorized 
into three types. First, the TIGM has been employing the 
gene-trap strategy. They use retroviral gene trap vectors con- 
taining 5' selectable marker P-geo, a functional fusion between 
the (3-galactosidase and neomycin resistance genes, for identi- 
fication of successful gene trap events. Later, it can be also 
used for the reporter gene in vivo. 

VelociGene, another KOMP member generates definitive 
null alleles (7). In most cases, they delete all protein-coding ge- 
nomic loci [i.e., both exons and introns ranging from start co- 
don to stop codon). Since they use modified bacterial artificial 
chromosome (BAC) clones as a targeting construct, this strat- 
egy can be applied to all genes transcribed by RNA polymer- 
ase II, and does not need to be restricted by the size of a gene, 



intron-exon structure, RNA splicing pattern, or protein-coding 
capacity (7). 

The other IKMC members are archiving mutant ES cells and 
mice containing a versatile allele called the knockout first al- 
lele (Fig. 1). The initial allele (tmla) has a gene trapping cas- 
sette in an intron, and the critical exon(s) is flanked with LoxP 
sequences. After establishing a mutant mouse line, the gene 
trapping capacity can be immediately utilized. In addition, a 
lacZ gene contained in the promoter-less trapping cassette can 
be used as a reporter to monitor the gene expression in vivo. 
This gene trap cassette is flanked with Frt sequences, and thus 
can be removed by crossing mutant mice with transgenic mice 
expressing Flp recombinase in germ cells (tm1c) such as 
ACT:FLPe transgenic mice (8). Removal of the gene trap ele- 
ment restores normal gene activity. After eliminating the gene 
trap cassette, the floxed exon of the tm1c allele can be deleted 
using Cre recombinase (tm1d), which induces a frameshift mu- 
tation and subsequent nonsense mediated decay (NMD) of the 
mutant transcript (9). If both the trapping cassette and floxed 
exon of the tmla allele are deleted, this mouse line can only 
be used as a lacZ-tagged reporter mouse line (tmlb). 

As of October, 2012, ES cell clones with conditional alleles 
for 8,897 genes, deletion alleles for 3,622 genes, and gene 
trap alleles for 9,583 genes are generated by the IKMC, and a 
small number of mutant mice are also produced using these ES 
cell clones (http://www.knockoutmouse.org/about/geneprogres 
ssummary). The targeted ES cell lines and mutant mice can be 
easily searched at the IKMC web site (http://www.knock- 
outmouse.org/). 

The practical use of Cre recombinase-expressing mouse lines 

The conditional knockout mouse strategy is now prevailing as 
it has overwhelming advantages over classical knockout and 
gene trap strategies. For example, if a conditional allele is in- 
troduced to the mouse genome, germ-line mutation can be al- 
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Fig. 1. The basic IKMC knockout al- 
lele (tmla). The lacZ cassette has a 
5'-splicing acceptor. Flp removes both 
lacZ and neo cassettes (tm1c), and 
Cre deletes both the neo cassette and 
exon 2 (tmlb). Further action of Cre 
on tm1c allele removes the critical 
exon 2 (tm1d). Adopted from Skarnes 
et a/. (9). 
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so generated using Protamin-Cre transgenic mice (10). In addi- 
tion, germ-line mutations of essential genes frequently result in 
embryonic lethality, preventing the studies of gene functions 
in post-natal stages including adult mice. Thus spatio-temporal 
control of Cre recombinase can be employed to bypass the 
embryonic lethality. Furthermore, generating mouse models 
for human diseases that frequently occur sporadically in specif- 
ic tissues is a more reasonable approach. Therefore, the gen- 
eration of reliable Cre transgenic mouse lines may be central 
for the analysis of in vivo gene functions using conditional 
knockout mice. Diverse Cre transgenic mouse lines that were 
previously developed can be searched in the web site of 
Mouse Genome Informatics (MGl, http://www.informatics.jax. 
org/recombinase.shtml). 

Cre transgenic mice have been being evolved further by 
adopting state-of-the-art technologies for the conditional mod- 
ulation of Cre activity. ER T2 , a modified ligand-binding domain 
of the estrogen receptor, has been successfully used to condi- 
tionally translocate a fusion protein into the nucleus upon the 
binding of 4-hydroxytamoxifen (4-OHT) (11). When 4-OHT 
binds to Cre-ER T2 fusion protein, this liganded complex trans- 
locates into the nucleus and deletes the floxed region. 
Combined with tissue-specific promoters, Cre-ER T2 transgenic 
mice are widely used to regulate gene knockout in a spa- 
tio-temporal manner. 

Although Cre lines are the necessary reagents for generating 
their conditional mutations, their use is still limiting. Currently, 
a single satisfactory database that well integrates and updates 
all available Cre lines does not exist. Furthermore, similar Cre 
lines are often generated independently by disparate groups, 
but their reports do not always describe their detailed and crit- 
ical characteristics including the efficiency of recombination, 
cell and tissue specificity, and genetic background effects. To 
improve the current insufficiency of and accessibility to appro- 
priate Cre mouse lines, the CREATE (Coordination of resources 
for conditional expression of mutated mouse alleles) project 
(http://www.creline.org/) and EUCOMMTOOLS are planning 
to maximize the utility of the conditional IKMC resources by 
integrating and generating conditional Cre mouse lines (12). 

The advanced version of Cre mouse lines 

These days, the caveats of the previously generated Cre trans- 
genic mouse lines including Cre expressions in off-target tis- 
sues, effect of the parent-of-origin, and inconsistent Cre re- 
combination among littermates are being identified and ad- 
dressed (13). For example, the expressions of Cre transgenes 
do not exactly coincide with those of the cognate genes. 
£mx7-Cre transgenic mice were generated using the 1 1-kb se- 
quence located upstream of the Emxl gene, but the Cre-medi- 
ated recombination did not match the endogenous expression 
pattern of Emx1 gene (14). That is caused by the use of a parti- 
al, but not the entire region of the promoter, thus discarding 
potentially important regulatory elements (e.g., enhancers) re- 
quired for the intended Cre expression. Furthermore, the ex- 



pression of Cre transgene may be affected by the genomic en- 
vironment near the transgene integration site. 

The knock-in strategy will make the Cre expression pattern 
much more similar to the cognate gene. However, it may be al- 
so problematic. For example, conventional Cre knock-in in- 
evitably mutates one wild-type allele, and thus haplo-in- 
sufficient phenotypes are possibly induced. However, Chapman 
et a/, described the novel strategy that can potentially solve the 
drawback (15). They integrated a "self-cleaving" E2A peptide se- 
quence-fused Cre-ER T2 coding cassette into the stop codon of 
the endogenous surfactant C (SPC) gene. This strategy maintains 
endogenous SPC, and, at the same time, the same numbers of 
Cre-ER T2 protein are produced. It will be very helpful if these ad- 
vantages are reflected for the future generation of all Cre mouse 
lines. 

ENGINEERED NUCLEASES 

Until recently, mouse ES cells were essential for custom-de- 
signed genetic modifications of the mouse genome. Now, en- 
gineered nucleases including zinc-finger nucleases (ZFNs) or 
Transcription Activator-Like Effector (TALE) Nucleases (TALENs) 
are rapidly evolving. Injection of ZFN or TALEN mRNA tran- 
scribed in vitro into one-cell stage mouse embryos allows rapid 
creation of genetically-engineered mouse (GEM) models with- 
out generating gene-targeted ES cell clones. Here, we discuss 
the basic characteristics and applications of engineered nucle- 
ases in regard to genetic engineering in vivo. 

Sequence-specific recognition and restriction by ZFNs and 
TALENs 

ZFNs and TALENs have been produced by connecting se- 
quence-specific zinc finger arrays or TALEs for chosen DNA 
sequences to the cleavage domain of Fok I endonuclease 
through a linker (Fig. 2A, B). The modular structure of zinc fin- 
gers and TALEs support the generation of custom-designed 
DNA-binding domains for chosen DNA sequences with an ex- 
ceptional specificity. After the pioneering work done by 
Bibikova et al. (16), zinc-finger domains have been intensively 
studied, and are being used widely for the genome mod- 
ification (17-20). Individual zinc finger motifs are capable of 
binding triplets of DNA sequence, and thus zinc-finger pro- 
teins with adequate specificity can be produced through the 
modular assembly of individual zinc finger motifs (21, 22); 
e.g., six zinc-finger motifs recognizing a specific 18-bp se- 
quence is sufficient to pick up a single genomic locus in the 
mouse genome (23). 

TALEs are transcription factors produced by bacteria] plant 
pathogen, Xanthomonas, that enter into the nucleus and di- 
rectly activate the expression of host genes (24). Boch et al. 
uncovered the cipher of TALEs responsible for their se- 
quence-specific recognition of the target sequences (25). The 
DNA-binding domains of TALE are typically composed of re- 
peats of 34 highly conserved amino-acid residues with the ex- 
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Fig. 2. The principles of ZFN- or TALEN-mediated gene targeting. (A) Zinc finger (ZF) motifs and (B) TALE modules are designed to detect 
specific sequences in an opposite direction. Previously developed, heterogeneous nuclease domains of Fokl endonuclease can only form a 
heterodimer that is essential to exert endonuclease activity (32). (C) When connected to ZFs or TALEs through a linker, each pair of ZFNs 
or TALENs induces a double-strand break (DSB) at the spacer region between the binding sequences. DSBs are repaired through homolo- 
gous recombination (HR) or non-homologous end joining (NHEJ). (D) If a donor DNA containing an additional sequence or modifications 
of the endogenous sequence, HR-mediated gene conversion can be achieved. (E) Without the template for HR-mediated DNA repair, DSB 
will be repaired through the error-prone NHEJ pathway, frequently resulting in deletion, insertion, and substitution of the original 
sequence. 



ception of the 12th and 13th amino acid. Each repeat recog- 
nizes one base pair in the target DNA, and these two variable 
amino-acid residues dictate the specificity of each repeat (e.g., 
HD, NG, Nl, and NN for C, T, A, and G, respectively). 
Exploiting this modular interaction, Christian et al. created the 
first TALEN, and validated its activity on synthetic targets (26). 
Subsequent studies proved the versatile nuclease activity of 
TALENs against specific genomic loci in yeasts (27), and in 
cultured human cells (28). Further, the minimal DNA-binding 
domain of TALE proteins were determined, and their targeting 
specificity was validated (28, 29). 

The dimerization of Fokl nuclease domains is required to 
cleave the target DNA (30). Normally, a pair of ZFNs or 
TALENs is designed to target a single locus, and the target se- 
quence is placed between their binding sequences. Therefore, 
conventional ZFNs and TALENs act as heterodimers to induce 
double strand breaks (DSBs) at their target loci (Fig. 2C). This 
suggests that homodimer formation of engineered nucleases 
may induce DSBs at off-target sites, i.e., DNA sequences other 
than the intended target sequence (31). To minimize the ho- 
modimer formation, Miller et al. developed two heterodimeric 
Fokl cleavage domain variants by engineering the interface 
mediating their interaction (32). This structural modification 
significantly decreases homodimer formation and the potential 
toxicity associated with off-target effects, and generally contrib- 
utes to improving the specificity of ZFNs and TALENs. 



Due to their relatively long history, ZFNs have been ex- 
tensively studied not only to improve the targeting specificity, 
but also to minimize the cytotoxicity associated with off-target 
effects. However, there still seems to be debates on how to ef- 
ficiently design ZFNs with sufficient specificities (33-35). As 
well as the simple cipher of TALE binding to DNA (25), 
TALENs are rapidly evolving through absorbing the systemic 
advantages achieved from studying ZFNs. 

Genome editing by ZFNs and TALENs 

The two subunits of a ZFN and TALEN pair bind to two ad- 
jacent sequences separated by a short spacer sequence in an 
inverted orientation (Fig. 2A, B) and the dimerized Fokl nucle- 
ase domain introduces DSBs at the spacer region (Fig. 2C). 
These DSBs, inducing deleterious effects on the genomic in- 
tegrity, are generally solved by two highly conserved DNA re- 
pair pathways (Fig. 2D, E): homologous recombination (HR) 
and non-homologous end-joining (NHEJ). These innate re- 
sponses of the cells against DSBs are enabling us to edit the 
genomes of diverse species using ZFNs and TALENs. 
Especially, after the first successful application in Drosophila 
(16), a variety of mutant organisms, including mice, has been 
created using ZFNs (18). 

DNA repair by NHEJ is a rapid and efficient process, but is 
error-prone (36). Previous studies indicate that ZFN-induced 
DSBs are prevalently repaired by NHEJ, and that null muta- 
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tions induced by small insertions, deletions, and even sub- 
stitutions frequently appear at the targeted locus (17). In con- 
trast, HR is quite a rare event, but is essential for gene targeting 
in ES cells (37). However, when a DSB is introduced, the effi- 
ciency of HR can be significantly elevated in mammalian cells 
(38). Similarly, ZFN-induced DSBs effectively activates HR in 
human cells (39). Along this line, gene-targeted mice were 
generated by directly injecting ZFN mRNAs and the targeting 
construct into the pronuclear mouse embryos (40). Further- 
more, instead of the plasmid-based gene-targeting vector that 
is laborious and time-consuming to construct, a single strand- 
ed synthetic oligonucleotide is also successfully used as a tem- 
plate for ZFN-induced homology-directed repair in mouse em- 
bryos of the pronuclear stage (41). These results indicate that 
ES cells are not an essential reagent for the targeted mod- 
ification of the mouse genome any more. 

Despite the short history, TALENs have rapidly developed as 
valuable tools for genome editing (42), and are being applied 
for generating genetically-engineered organisms of diverse 
species. In fact, TALENs were used to target genes in C. e/e- 
gans, zebrafish, rats, cows, and pigs (43-47). In general, the ef- 
ficiency of the genome modification by TALENs seems to ap- 
proach the levels induced by ZFNs (28). In addition, much evi- 
dence points out the advantages of TALENs over ZNFs. For ex- 
ample, as compared to the corresponding ZFNs in human 
cells, even though TALENs were similar in gene disruption ac- 
tivities, they exhibited significantly reduced nuclease-asso- 
ciated cytotoxic activity, and revealed only minimal off-target 
activity at the highly homologous sequence (29). Consistently, 
compared to the efficiency obtained with ZFNs (48), TALEN 
showed less toxicity in generating TALEN-mediated knockout 
rats (47). These beneficial aspects, in conjunction with the sim- 
ple decipher (25), may lead to the prevalent use of TALENs in 
directly generating GEM models. 

Replacement of the use of ES cells 

It is plausible to expect that engineered nucleases will consid- 
erably replace the use of ES cells in modifying the mouse 
genome. In addition to generating null alleles, engineered nu- 
cleases will be an effective tool for the knock-in strategies aim- 
ing to substitute amino acid(s) or to insert a short stretch of 
nucleotides. For example, to study endogenous transporter 
trafficking of the plasma membrane dopamine (DA) transporter 
(DAT), Rao et a/, established a knock-in mouse line harboring 
a hemagglutinin (HA)-epitope tag at the second extracellular 
loop of mouse DAT via the homologous recombination in ES 
cells (49). As the use of engineered nucleases enables the se- 
quence-specific modification of the mouse genome in embryos 
(41, 50), it could be directly achieved by employing an effi- 
cient engineered nuclease and donor construct or oligonucleo- 
tide by avoiding the laborious and time-consuming steps of es- 
tablishing targeted ES cells, generation of chimeras, and 
germ-line transmission. According to Cui et a/., even an ex- 
pression cassette can be inserted by the aid of engineered nu- 



cleases (50). It may provide a more expanded potential for 
generating diverse mouse models, and the use of engineered 
nucleases will make it possible to avoid the use of ES cells. 
Therefore, the mouse generation procedure should be sim- 
plified by the use of engineered nucleases. 

The potential use of engineered nucleases for generating Cre 
mouse lines 

The need for appropriate Cre mouse lines is enormously in- 
creasing, proportional to the expansion of publically available 
conditional KO mouse lines. As mentioned above, one of the 
most advanced versions of Cre mouse lines was produced by 
inserting a self-cleaving peptide-fused Cre-ER T2 -encoding se- 
quence into a stop codon of a gene (15). This strategy is com- 
patible to the use of engineered nucleases in fertilized mouse 
eggs, and thus will be beneficial for the efficient and massive 
generation of Cre mouse lines. 

CONCLUSIONS 

The international efforts through the IKMC are expanding the 
mutant mouse resources, and are promoting public access to 
GEM models. In addition, the current development and versa- 
tility of engineered nucleases are pushing the generation of 
novel mouse models by circumventing the use of ES cells. 
Furthermore, these technologies will synergize to accelerate 
the establishment and use of GEM models as a more general- 
ized tool for studying in vivo gene functions. 
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